A UNIFIED TREATMENT OF DIVIDEND PAYMENT PROBLEMS UNDER FIXED 

COST AND IMPLEMENTATION DELAYS 



ERHAN BAYRAKTAR AND MASAHIKO EGAMI 

Abstract. In this paper we solve the dividend optimization problem for a corporation or a financial insti- 
tution when the managers of the corporation are facing (regulatory) implementation delays. We consider 
several cash reservoir models for the firm including two mean-reverting processes, Ornstein-Uhlenbeck 
and square-root processes. Since the cashflow structure of different companies have different qualitative 
behaviors it makes sense to use different diffusions to model them. We provide a unified mathematical 
framework to analyze all these models and find the optimal barrier strategies. Our solution depends on a 
new characterization of the value function for one-dimensional diffusions and provide easily implementable 
algorithms to find the optimal control and the value function. 



1. Introduction 

In this paper, we solve the dividend optimization problem for a corporation or a financial institution. 
The corporation controls the timing and the amount of dividends and the objective of the corporation 
is to maximize the total discounted dividends paid out to shareholders until the time of bankruptcy 
given that the dividend payments are subject to regulatory delay. The payment of a dividend is not 
automatic and payments can be made only after a certain amount of time elapses. The amount and 
the timing of payment is decided by the company managers but these are subject to the approval of the 
company's owners (shareholders) and maybe also of debt holders and therefore it takes some time before 
the dividends are paid. Recently, there have been other papers on optimally controlling a state variable 
subject to implementation delays in different modeling contexts, see e.g. [2], [3], [4], [16], [21] and [25]. 
Our methodolgy of solving this problem is in the spirit of [4] and differs from the other papers cited above 
as will be made clear below. 

We model the problem of the corporation as an impulse control problem and assume that when dividend 
is paid out, the firm has to pay a fixed cost representing the resources it has to devote to the distribution 
of dividends. This amount is independent of the size of the dividend payment. Other papers modeling 
the dividend payment problem as an impulse control problem are [7], [13] and [23]. There are several 
other papers which model the dividend payment problem as a singular stochastic control problem by 
assuming that there is no fixed cost at the time of dividend payment; see e.g. [10], [11], [12], [13] and 
[26]. 
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Applying an appropriate transformation to the value of a particular control, we transform the problem 
into a non-linear programming problem. Using the new characterization of the value function we give 
an easy to implement algorithm to determine the optimal control and the value function. A secondary 
result of our paper are the sufficient conditions under which the smooth fit holds (see Remark 4.1 and 
Proposition 4.1). In contrast, in the literature impulse control problems are solved first finding a classical 
solution to a system of quasi variational inequalities. The optimal thresholds are determined using the 
so-called "smooth fit principle" (by hypothesizing that the smooth fit holds). (Once a classical solution 
to this system is determined it can be shown to be equal to the value function by the so called verification 
lemma.) See e.g. Bensoussan and Lions [5] and 0ksendal and Sulem [20]. 

In this paper, the time horizon is the time of ruin, and this makes the analysis more difficult from 
that of [4], which only considers infinite horizon problems. Since cashflow of different companies have 
different qualitative behavior, a manager needs a portfolio of tractable models to choose from. Here 
we consider four models for the aggregate income/cash reservoir of the firm: i) Brownian motion with 
drift, ii) Ornstein-Uhlenbeck, iii) Square-root process, iv) Geometric Brownian motion. Most of the 
papers related to stochastic impulse control, in order to obtain analytical solutions, assume that the 
uncontrolled process is a Brownian motion with drift. In addition to using Brownian motion to model 
the cash reservoir, we also propose two mean reverting processes as possible modeling alternatives which 
is suggested by the Cash Flow Hypothesis in Jensen [14]; see [12] for further motivation. On the other 
hand, geometric Brownian motion is used to model the firm value in the structural models in credit 
risk modeling. Our solution for the geometric Brownian motion model can also be interpreted as the 
optimal dividend distribution to the stockholders of a given company since geometric Brownian motion 
is frequently used to model the value of a company (for e.g. in the structural credit risk models) [18]; 
see [10] for further motivation. As far as we know, our paper is the first one that explicitly handles the 
dividend payment problem for for the square root process (with or without delays). 

The rest of the paper is organized as follows: In Section 2, we present the models for the cash reservoir 
and state the dividend payment problem. In Section 3, we provide a characterization of the value function 
for a given threshold strategy. In Section 4, we provide an easily implementable algorithm to find the 
optimal threshold strategy. We also provide theoretical justification for our algorithm in this section (see 
e.g. Proposition 4.1). We then check that the models satisfy the sufficient assumptions of optimality in 
Section 4.3. Finally, in Section 5 we present some numerical examples. 

2. Statement of The Problem 

Let (£l,J-,¥) be a complete probability space with a standard Brownian motion W = {Wt;t > 0}. We 
model the aggregate income process X° as either the Brownian motion 

(2.1) dX% = volt + adW u X% = x > 0, 
for some constants fi, a > 0; or the Ornstein Uhlenbeck process 

(2.2) dX? = -pX^dt + dW t , X$ = x>0, 
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for some constant p > 0, or the square root process 

(2.3) dX® = (1 - 2pX?)dt + 2y[x%dW t , X% = x > 0. 

Note that if the initial condition of (2.3) is properly chosen, then the solution of it is the square of the 
solution of (2.2). We will also consider the case when the aggregate income process follows the geometric 
Brownian motion 

(2.4) dX? = pX?dt + aX\dW u Xl = x > 0. 

The firm will pay dividends to its shareholders out of the aggregate income process X° and the net 
holdings of the firm, i.e. the net income process will be denoted by X. We assume that the company 
pays out dividends to its shareholders in order to maximize the expected value of discounted dividends 
paid out until the time of ruin. There will be a fixed amount of transaction cost for making a dividend 
payment. In this framework a dividend payment scheme that a firm follows can be represented by a 
doubly stochastic sequence 

v = (Ti,T 2 , ....I*....; £1,6$, ...&....), 
where < T\ < T 2 < .... is an increasing sequence of F-stopping times such that Tj +1 — Tj > A, and 
£i, £2— are ^(Ti+A)- measurable random variables representing the dividend amount paid out. The firm 
decides to make dividend payments at (random) time Tj, but it can not act until time T + A (where 
A > is a constant). It decides on the magnitude of the dividend amount at Tj + A depending on the 
level of its revenues. We will in particular consider benchmark strategies. These strategies are determined 
by specifying two numbers < a < b as follows: At the time the aggregate profit (or the firm value) hits 
a large enough level 6, the shareholders ask the firm to commit to making dividend payments and reduce 
the level of net profits (or the firm value) to a. We denote by V the set of strategies that fit into this 
description. We will refer to them as the admissible strategies. 

The net income process follows (until after the first dividend payment) 



(2.5) 



dX t = p(X t )dt + a(X t )dW u < t < Ti + A, 

X^+A = A( Tl+A )_ - £1, 

for appropriate functions p and a depending on which case we are inspecting. For the first three cases 
we assume that is the absorbing state and define tq (the time of ruin) as : 

r = inf{< > : X t < 0}. 

When the aggregate income process follows the geometric Brownian motion, the time of ruin is defined 
as 



(2.6) 



T d ± inf{t > : X t < d}, 



for some fixed d > 0. The purpose of the firm is to maximize expected value of the discounted dividend 
payments until the time of ruin, i.e., 



(2.7) 



r(x) = e 2 



£ e-°^K(X m+A) _,X Ti+A ) 

T»+A<r 
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over all the admissible strategies. We will assume that 



K ( X (Ti+A)-^T 1 +a) — -X"(Ti+A)_ - X Tl +A - A, 



where A > is a fee associated with a transaction. We could also consider 



K ( X (T!+A)-^ x t ± +a) = k • (X( Tl+A )_ - Xt ± +a) - A 



for k 6 (0, 1), in which 1 — k can be considered as the tax rate. This does not affect the analysis and 
therefore we will focus on the case when k = 1. 

Let us denote the value function of this problem by 



When X° is the Ornstein Uhlenbeck process, in addition to considering the performance function in 
(2.7) we will also consider the following performance 



for some constant P > 0. The rationale for considering this penalty function is to penalize declaring 
banktruptcy. As we shall see if the purpose is to maximize the performance function in (2.7), when X° 
follows an OU process, it is optimal to declare bankruptcy when the aggregate income process reaches a 
certain level. Therefore the OU process might be used to model the income process of firms in distress. 
This might give an idea to the creditors of how this type of a firm might behave. The extra cost in (2.9) 
will, on the other hand, deter the firms from declaring bankruptcy. 



In this section, we will show that when we apply a suitable transformation (see 3.6) to the value function 
corresponding to a particular threshold strategy (that is identified by a pair (a, &)), the transformed value 
function is linear on (-F(O), F(b)). This characterization will become important in determining the optimal 



(2.8) 



v(x) = sup J v {x) = J u (x). 



(2.9) 




Ti+A<T 



3. Characterization of the Value Function 



threshold strategy in the next section. Equation (2.7) can be developed as 



J u (x) = E 1 



£ e-<*^K(X (Ti+A) _,X Ti+A ) 

T l +A<r 



1 {T 1 +A<T }{e Q ( Tl+A ) K(X( Tl+A) _, X Tl+A ) 



i>l:Ti+A<r , 



E 1 * 



1 c -a(Ti+A) 



^(X m+A) _,X Tl+A ) + E x ^ e-^+ A )^(X m+A) _,X Ti+A ) 

Ti+A<ro 



l{T 1+ A<r o} e- a(Tl+A) {^(X (Tl+A) _,X Tl+A ) + J U (X Tl+A )}] , 



where we used Ti + \ + A = (T\ + A) + (Tj + A) o #(Ti + A) with the shift operator #(•) in the second 
equality. 

Since T\ = with r& = inf {t > : X® > b} and the post intervention point by 



(3.1) 

It follows that 



^Ti+A — X Tb+A — X( Tb+A y - £i 



a. 



r(x) = E* [l {T6+A < ro} e-«^ +A ) {jf(X (7l+A) _,a) + J»} 



E 2 



E' 1 



_l { r b+ A< T - }^ a(r6+A) {if(X (u+A) _,a) + r{a)} | ^ 
= E* [l {T6<ro} e-^E^ [l {A<To} e- aA {K(X A _, a) + r(a)}]] 
Evaluating J u at x = b, we obtain 
(3.2) r(b) = E b [l {A<TQ} e- aA (K(X A .,a) + J»)] , 

and evaluating at x = we get J u (0) = 0. 



Now, we can write (3.2) as 



(3.3) 



r( x ) 



E* [l {Tb<To} e-^r(b) + l {Tb>To} e-^r(0)) 0<x<b 



^ x [l {A <r }e- aA (K(X A ,a) + J" (a))] x > b. 

Similarly, if the performance function to minimize is the one defined in (2.9), then we have 

E* [\ {Tb<To} e-^r{b) + l {Tb> r 0} e- aT °J"(0)] , < x < b, 



(3.4) 



r x) 



E x [l {A<To} e- aA (K(X A ,a) + J"(a)) - Pl {A>To}e -^] x > b. 
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In this case J v (0) = — P and 

r(b) = E b [l {A<T(>} e- aA (K(X A ,a) + J») - Pl {A >r o} e 



ar 



We will denote the infinitesimal generator of the process X° by A. Let us denote the increasing and 
decreasing solutions of the second-order ordinary differential equation (.A — a)u = by and (p(-) 
respectively (these are uniquely determined up to a multiplication). We can write 

_ ip(l)(f(x) -jj(x)ip(l) aT 1 _ jj(x)(p{r) -ij>{r)ip{x) 

for x G [Z,r] where 77 = inf{t > 0;X® = 1} and r r = inf{i > 0; X t ° = r} (see e.g. Dayanik and Karatzas 
[9]). Let us introduce the increasing function 



F(x) 



ip(x)' 
By defining 

(3.6) W = {J u /ip)oF-\ 
on x G [0, b], using (3.5), equation (3.3) on < x < b becomes 

(3.7) W(F(x)) = WiFib) /^:^ + Ty(F(0)) || = ^ 0<x<b 

which shows that the value function is linear in the transformed space. Next, we will compute 

(3.8) B 4 E* [l {A<ro} e-" A (^(X A , a) + J»(a))] , 

in (3.3) for all the different models of aggregate income process. (In the case of geometric Brownian 
motion we will replace To by in (3.8). Moreover the function J u {x) for this case is given by replacing 
0's with d's in (3.4).) 

3.1. Computation of B in (3.8). 

3.1.1. Ornstein-Uhlenbeck Process. Let's first consider the case when X° is the Ornstein-Uhlenbeck pro- 
cess given by (2.2). Recall that X? can be written as (can be derived using Theorem 4.6 of Karatzas and 
Shereve [15]) 

1 _ e -2pt 

(3.9) X t ° = xe~ pt +B Q u), where Q(t) = , or 



e 



2p 

2pt 



(3.10) e pt X° t =x + B Q{ty where Q{t) - — , 

and B and B are Brownian motions. This implies that the distribution of X® is Gsn (xe~ at , Q(t)j. (We 
use Gsn(a, b) to denote a Gaussian random variable with mean a and variance b.) As a result of the 
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representation in (3.10) 

Q /*OC 

¥ x (t > A) = P a '(^ > 0(A)) = 1 - -= / e- u2 / 2 du 

V2lT Jx/\/ Q(A) 

(3.11) 

= 2 AT ( 



where Tg 8 is the first time the Brownian motion x + B hits zero. Here, we used the distribution of the 
hitting times of Brownian motion (see page 96 of Karatzas and Shreve) . We also used the notation that 
N{x) = Zoo V(V2^)e- u2/2 dn. 

Let us try to identify the density function of Y A — X A l^ TQ>A y. To this end we first compute 

(3.12) 

F X {X° A > y, to > A} = F X {X° A >y)- F X (X° A > y, r < A} 

= F X {X° A >y}- F X {X° A < -y, r < A} 

= W x {X A >y}-F x {X A <-y} 

1 (T ( {u-xe~P A ) 2 \ , f-y ( (u-xe-P A ) 2 \ \ 
exp du — exp au . 



V2vrQ(A) V/ y V 2Q(A) ; ./-oo V 2Q(A) 

Here, the second equality follows from the fact that OU process satisfies a reflection principle around 
zero, and the third inequality follows from the fact that {X A < — y} D {tq < A} since y > 0. The last 
line implies that (after taking the derivative with respect to y and flipping the sign) the density of the 
random variable Y A = X A l^ T0>A y is given by 

(313> g(y)= VmmV XP V- 2Q(A) ]—"(- 2«(A) JJ- »> ft 

Using (3.11) and (3.13), we can write 
(3.14) 

B = E x [l {A<TQ} e- aA (K(X A ,a) + J»)] = e~ aA ^ (^2N \^^J ~ ^ O) - a - A) + a) , 
in which 

l-OO 

A= yq(y)dy. 
Jo 

Since 

9<!XP I" 20(A) j * = - /_„,-„. 9<JXP (~ 2Q(A) j * 
we can write A as 

1 / /"» / fo-ie-^N [°° ( (y + xe-*? \ 

= vmm 11- * p r ^ J * - L- 96X13 r «w J * 

For any /i£M and cr > we have that 



As a result 

(3.15) A = xe'P A . 

Observe from the above calculations that X A l{ T0>A y and X A have the same expectation. 
We will also compute the quantity 

(3.16) B 4 E x [l {A<TQ} e~ aA (K(X A ,a) + J" (a)) - Pl {A>To} e~ a ^] , 

for this case. Using the density of the hitting time of 0, which can be derived by differentiating (3.11) we 
can write 

(3 - 17) B — B - Pfe--^ (-A^) ^ exp (-J^gl + §) * 

There is not explicit expression available for the integral term (even in terms of special functions, except 
when A = oo, see e.g. [6] and [8], in which case this integral is the Laplace transform of the distribution 
of ro) but the NIntegrate function of Mathematica is able to evaluate it with a very high numerical 
precision. 

Remark 3.1. We can compute B in (3.8) explicitly even for the cases when X° follows 
(3.18) dX? = {4>- pX^)dt + adWt, X = x>0, 

for (f>, a > by using the Strong Markov property to compute 

E x [X A l {A<To} ] = E X [X A ] -E x [l {A > To} X A ]. 
The Strong Markov property is used to compute 

E*[l {A > To} X A ] = E*[l {A > To} E*[A A |^ T(J ]] =E*[1 {a > to} E°[Aa„ to ]] 

= / f(u)E°[X A . u ] = <j> [ f(u)(l - exp(-p(A - u)))du, 
Jo Jo 

where f is the density function of tq. Several representations for f are available, see for e.g. [1]. 

3.1.2. Square-root Process. To evaluate B in (3.8) when the aggregate income process is modeled by the 
square root process in (2.3) we need to compute 

oo 

2^/ 



(3.19) C± / y 2 q(y)dy, 

Jo 

in which q(y) is equal to the q(y) in (3.13) if x is replaced by y/x. This follows because if (2.2) is started 
from \fx, then the solution of it is the square root of the solution of (2.3). Let us first evaluate 

(3.20) 

- 1 r y 2 cxp (- {V ~ ^ £ A 7 A)2 1 dy = -L f° (vVOi^j + V^ A ) 2 cxp ) dy 

v/27rQ(A) J y P V 2Q(A) J J-^t V 

- g<A V:^» 2 -(4)^^^/^,« P (4),„ + ^(^) 

. IrTl . /Q(A) ' / 



Q(A) [°° 2 / y 2 \, 2VQ(A) _ pA ( xe~^ \ _ 2pi 

/ y exp ay H ■ — v^ e exp — — + xe ^ TV — . 

i- ^ff V 2; V2T P V 2Q(A); Vv 7 ^)/ 



/2tt 



Since , 1 / °° y 2 exp (— ' ) ^ can ^ e obtained by nipping the sign in front of y/x in (3.20), 



V 27r( 3( A ) 

the computation of C will follow. We also have that 



(3.21) 



2 Vl l\/Q(A)j n/S 2Q(A) ^ 



Form (3.20) and (3.21), we can evaluate C as 



(3.22) C= (xe- 2pA + Q(A)) 



2N 



V7g(A); 

When X° is the square root process then B defined in (3.8) equals 

(3.23) B = e-(( 2 A r (^5)-l)(,»- a -A) +C ). 

3.1.3. Brownian Motion with Drift. Similarly, using reflection principle, Girsanov's Theorem and the 
spatial homogeneity of Brownian motion we will obtain B in (3.14) when X° is a Brownian motion given 
by (2.1). We will first need the following lemma, which is Corollary B.3.4 in [19]. 

Lemma 3.1. Let Yj = aWt + /Ui and mj = min ue \o£ Y u . Then 

(3.24) F{Y t > V, ml > m} = N (^jf) ~ e 2 ^ N ( ^^) , 
for every m < and y > m. 

We can write B as 

(3.25) B = E x+e [l {A <r 9 }^A] , 

in which 6 = J h '(a) — a — A, which follows from the spatial homogeneity of Brownian motion. Note that 
for any y > 9 

Z = l{A<r e }^A > V iff X A >y,m%°>6. 
We will find the probability density function of Z. Let us first define 



Y t ±X t -(0 + x) 



which implies that mj = mf° — (9 + x). With this new definition 
(3.26) 

F{Z > y} = HYa > y-(x + 9), m l >-x} = N ( " y + ^ +MA ) -e-^N (zEZJL^J^j 
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Here, the second equality follows from Lemma 3.1. Now, the density of the random variable Z is easy to 
calculate and using that we can compute B by calculating the expectation of Z and get 

l(x + fiA) 2 ' 



B = e 



-aA 



(x + /iA + J u {a) - a- X)N 



(3.27) 



-2fj,x/cr 2 



ax/A 

(-x + fiA + r(a) - a - \)N 



x + uA\ a\/A 

H 7^ exp 



2tt 



x + uA\ ay/A 

H 1= exp 



(T\/A 



2 ct 2 A 

1 (-a; + /iA) 1 



2?r 



<t 2 A 



3.1.4. Geometric Brownian Motion. We will use the down and out European call option price, see e.g. 
[24] , (when we take the strike price to be zero) to evaluate 



(3.28) 

in which 
(3.29) 



IE* [Wa } *a] = e^x N{d{) - - 



l+2n/a 2 



N(-d 2 



log § + Gu - ±a 2 )A 



do 



X J 

logf-(^-ia 2 )A 



ay/A ay/A 
In order to calculate B we also need to compute W{T d > A}. In fact 



¥ x {T d >A}= P x {rf > A}, 
in which t? is the hitting time of d < by the Brownian motion B = jt + aBt , where 



d = loe 



1 = H~^ • 



Using the hitting time distribution for Brownian motion (with drift), (which can be obtained from 
Lemma 3.1), we deduce 



(3.30) P x {r d > A} = P x {m% >d} = N 

Therefore, B can be written as 



-d + jt 



aVA 



exp 



2jd 
~a^ 



N 



d + jt 



a\/A 



(3.31) 



B = e 



-aA 



e^x N(di) 



1+2^/a 2 



N(-d 2 ) + (J» -a- X)F x {T d > A} 



4. An Efficient Algorithm to Calculate the Value Function 

4.1. Increasing and Decreasing Solutions of (A — a)u = 0. When X° is the Brownian motion in 
(2.1), then the increasing and decreasing solutions of (A — a)u = are 

(4.1) if>(x) = e DlX and <p(x) = e° 2X 

in which 

-// + \J ix 2 + 2acr 2 -/x - yV + 2acr 2 

£>l = and D 2 = • 

a z a A 

When X° is the Ornstein-Uhlenbeck process in (2.2), then 



(4.2) 



ip(x) = exp 



(px^ \ f px^ \ 

— J D_ a/p (-xy/2p), (p(x)=expl—jD_ a/p (xy/2p), x£ 
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where D v (-) is the parabolic cylinder function given in the Appendices 1.14 and 2.9 in [6] which is defined 
as 

D v {x)±2- V l*e-*I*H V (^\ ) xGR, 
in which H u is the Hermite polynomial of order u, which has the integral representation (see e.g. [17]) 

H v {x) = — r / exp {-t 2 - 2tx) t~ v ~ l dt, Re(i/) < 0. 

*-\— v ) Jo 

On the other hand, when X° is the square root process whose dynamics follows (2.3), then 
(4.3) i/>(x) = x~ 1 / 4 exp(?P\M_^, i _i(px), <p(x) = aT 1/4 exp ( ^ )w_*. , i _i(px), 

V 2 / 2p^4'4 \ 2 / 2p~ l ~4' 4 

in which W_ <* , i _i and M_a_ , i _i are Whittaker functions (see e.g. Appendix 2.10 of [61). These 

2p + 4' 4 2p + 4> 4 

functions satisfy 



(4.4) 



W_ a + l _1 (y) = 2^ ^D_ a/p (x), X > 0, 

M_ „ (y) = r((1 ^ P)/2) v^(^- a /p(-^) " D-a/pi*))> ^ > 0, 



in which T stands for the Gamma function F(x) = f °° u x 1 e u du. When, X° is the geometric Brownian 
motion, then 



(4.5) iih(x) = xv + ^ K (p(x) = x v + ^ K , 
in which k = fi/a 2 — 1/2. 

4.2. An Algorithm to Find the Optimal Control. In this section we will describe a numerical algo- 
rithm to find the value function. First we will introduce some notation that we facilitate our description. 

(4.6) e aA B =: r(x;a) + h(x)J u (a), 
where B is as in (3.8). We transform r and h by 

Note that r(o;o) < and that sup x r(x; a) > in all the cases considered above (see Section 4.3). 
First stage: For a given pair (a, b) S M. 2 , we will determine W in (3.6) using the linear characterization in 

(3.7) . On [F(0), F(b)\ we will find the line W(y) = 0y + £ that passes through the point (f(0), 
i.e., 

(4.8) { = -mo) - 

and satisfies 

(4.9) 0F(b) + £ = e~ aA [R(F(b); a) + H{F(b))^(a)(f3F(a) + £)] . 
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P = when we consider (3.3). The slope (3 can be determined as 



(4.10) 



P , -aA 

v(o) + e 



fl(i?(5);o)-fi(^(6)) V (o)^ 



- F(a) - e- QA (F(a) - F(0))H(F(b))ip(a) ' 
Now the function J u (x) in (3.4) can be written as 

(3tp(x) +£cp(x), 0<x<b, 



(4.11) 



r( x ) 



-aA 



(r(x;a) + h(x)J v (a)), x > b. 



Note that (A - a)J v {x) = for x < b. 

Second stage: Let us fix a and treat (3 as a function of 6 parametrized by a. We will maximize the 
function (3 in (4.10). Taking the derivative of (4.9) and evaluating at (3b = we obtain 

P 



(4.12) 







-aA 



+ H'(F(b)Ma)[p-(F(a)-F(0)) 



<p(0) 



y=F(b) 

in which (3 is as in (4.10). To find the optimal b given a we solve the non- linear and implicit equation 

(4.12) . 

Remark 4.1. On y > -F(fe), the function W is given by 

(4.13) W(y) = e~ aA [W(F(a))<p(a)H(y) + R(y; a)} . 
The right derivative of W at F(b) is 



(4.14) 



W(F(b)) 



-aA 



W(F(a)Ma)H'(F(b)) + ^R(y;a) 



y=F(b)/ 

where we used (4-12). This implies that the left and the right derivatives ofW are equal at F(b) (smooth 
fit), since the left derivative at F(b) is also equal to (3. 

Third stage: Now, we vary a E M+ and choose a* that maximizes a — > /3(a). We also find the corre- 
sponding b* = b(a*). Now, the value function is given by (4.11) when a and b are replaced by a* and 6* 
respectively. 

The next proposition justifies the second stage of our algorithm. 

Proposition 4.1. Assume that r(a;a) < and sup x r(x;a) > 0. Furthermore, if the functions R(-;a) 
and H(-) defined in (4-7) are increasing and concave on (y,oo) for some y > F(a), and the function h(-) 
defined in (4-6) satisfies h(-) £ (0,1), then for any given a > 0, (4-12) has a unique solution. 

The proof essentially follows from Remark 4.1. But we will have to introduce a series of lemmas before 
we justify our claim. 

First, let us also introduce a family of value functions parameterized by 7 G R as 

(4.15) V^(x) 4 supE* [e- ar r">(X T ;a)] where r~<{x;a) = e - aA (r(x; a) + 7 • h(x)), 

in which S is the set of stopping times of the natural filtration of X. Here, X is a diffusion on [0, 00), 
which is absorbed at the left boundary. (In the case of geometric Brownian motion this left boundary is 
taken to be d > 0.) Then we have the following result. 
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Lemma 4.2. Let us define 

(4.16) ^(-;a) A r7(F_1( ' ),a) 



then the function 

(4.17) W2{- 



is the smallest non-negative concave majorant of W that passes through (.F(0),0). Moreover under the 
assumptions of Proposition J^.l this majorant is linear in the continuation region (the region in which 
W 1 is strictly greater than R? ). 

Proof. The first part of the proof follows Proposition 5.3 of Dayanik and Karatzas [9]. The second part 
of the proof follows from the first and the fact that i? 7 (-;a) is increasing and concave on (y, oo). □ 

The following technical lemma will be used in showing the existence of 7 such that Va(a) = 7 for any 
a > 0. 

Lemma 4.3. 

(4.18) V a ^ 1 (x) — V^ 2 {x) < 71 — 72, for any 71 > 72 and x > 0. 

Proof. It is clear from (4.6) that 7 — ► Va(x) is an increasing convex function. Therefore the right- 
derivative 

DJV a ' (x) = hm 

7 hio h 

exists for any 7' > and it satisfies 

(4.19) K 7l (x)-K 72 (x) ^ jy+jrn^ 

71-72 

for any 71 > 72 (see e.g. [15], pages 213-214). Note that since h(-) £ (0, 1) we have that 

(4.20) < D+V^\x) < 1. 

Now, (4.19) and (4.20) together imply (4.18). □ 

Lemma 4.4. Under the assumptions of Proposition 4-1, there exists a unique 7 such that Va(a) = 7 for 
a > 0. 

Proof. Consider the function 7 — ► Va(a). Our aim is to show that there exists a fixed point to this 
function. Let us consider V a °(a) first. Since sup x r(x;a) > we have that V®(a) > 0. Now let us 
consider the case when 7 > 0. First, note that W2(F(a)) > W(F(a),a) for all 7. Since by Lemma 4.3 
V has less than linear growth in 7 and W is linear in 7, we can find a 7' large enough such that 
Wa(F(a)) = i? 7 (F(a),a) for 7 > 7 . This implies however 

V/(a) = ip{a)Ry'{F(a);a) = e^ aA (r(a;a) + 7 7i(a)) < j'. 

Since 7 — ► Va is continuous, which follows from the fact that this function is convex, V° > and 
Va («) < 7 implies that 7 — ► Va crosses the line 7^7. Since 7 — ► Va is increasing convex it crosses 
this line only once. □ 
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Proof of Proposition 4.1. The smallest concave majorant W2 in (4.17) is linear on (^(0), F(6 7 )) for a 
unique 6 7 S [0, oo) and smoothly fits to i? 7 (-; a) at V 1 and coincides with -R 7 (-, a) on [6 7 , d). Together with 
Lemma 4.4 this implies that there exists a unique 7* such that equations (4.13) and (4.14) are satisfied 
when W is replaced by W2 and b is replaced by b 1 * . If the solution of equations (4.13) and (4.14) were 
not unique, on the other hand, then one would be able to find multiple smooth fit points 6 7 , which yields 
a contradiction. 

4.3. Are the Assumptions of Proposition 4.1 Satisfied? The following remark will be helpful in 
the analysis that follows: 



Remark 4.2. Given a function k let us denote K(y) = — o F 1 {y), y > 0. If k is twice differentiate at 

m'(x) -.I 
— With, 



A k 

<P 

x > and if we denote y = F(x), then K'(y) = m(x) and K"(y) = 

(4.21) m ( x )=*_ (T) ( x ) a nd K"{y)[{A - a)k(x)} > 0, y = F(x), 

with strict inequality if H"(y) 7^ 0. The inequality in (4-21) is useful in identifying the concavity of K . 
4.3.1. Brownian Motion with Drift. In this case r(x;a) and h(x) defined in (4.6) are given by 

r(x; a) = (x + //A — a- X)N (^=^) + °^<t> (^^) 

,, 22) - ( ( _ x + ^l x)N (z£+£ ) + ^ , 

in which 0(x) = (l/ v / 2^)e- x2 / 2 . 

First note that h(x) € (0, 1). It is enough to show that R(-;a) and H(-) are eventually increasing, and 
are eventually concave. First, we will show that they are eventually increasing. The derivative of R(-; a) 
has the same sign as 

2 /r(x;a)\' j { x + /j,A\ (a + A) / x + /j,A\ 



+ e -2^ ( N + _ (« + A)^ f-x + »AY 

(4 23) V V o-Va J o-Va V o-Va ), 

since F is an increasing function. If we take x > a (a is fixed) large enough, the third line of (4.23) 
dominates the the other lines. Since D\ > 0, we can conclude that there exists x' > a such that 
(^)'(s) >0onx<E (x'.oo). 
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e 



On the other hand, directly taking the derivative, h{x) can be shown to be an increasing function in 
x G M+, from which it follows that H(y) = h(F~ l (y)) / 'ip{F~ l (y)) is also increasing. 

Next, we will show that R and H are eventually concave. Consider the equation {A—a)r{x; a) = p(x; a) 
so that 

p(x; a) = /j,r'(x; a) + -a 2 r"(x; a) — ar(x; a). 

Directly taking the derivatives and letting x — > oo, we obtain r'(x;a) — > 1, r"(x; a) — > and r(x; a) — > oo. 
Therefore lim x ^ co p(x; a) = — oo. Similarly, we consider the equation q(x) = (A — a)h(x). By letting 
x — ► oo, we have h{x) — > 1, — > and h"(x) — > so that lim^^oo q(x) < 0. Together with Remark 4.2, 
these facts imply that R(-,a) and #(■) are concave on y G (y",+oo) for some y"F(a). 

4.3.2. Ornstein-Uhlenbeck Process. We will only consider the case when the performance function is as 
in (3.3). The analysis for the case when declaring banktruptcy is penalized can be performed similarly, 
since first and the second derivatives of the integral term in (3.17) with respect to the x variable goes to 
zero as x — > oo. 

In this case r(x;a) and h{x) defined in (4.6) are given by 

(4.24) r(x; a) = xe~ pA — ( 2N [ ^ = ] - 1 | (a + A), h(x) =2N\ X € = | - 1. 

First, observe that r(x; a) > on (xi, oo) with some x\ > a. By taking the derivative of r(x; a), we have 

^ s/QiE) \VQiA))) 

From this expression we see that r'(x;a) > on x G (x2,oo) with some X2 > a. Let us denote x' = 
max(xi,X2)- It follows that R{y) is increasing on y G (y',oo) with y' = F{x') because 

\ ' r > — r ' 

- ) = k with ip' < 0. 

Observe also that h(x) G (0, 1) and h'{x) > on x G E+. 

Next, we will analyze the concavity properties of R(-;a) and H. Consider the equation (A — a)r(x; a) = 
p(x; a) so that 

p(x; a) = —pxr'(x\ a) + -r"(x; a) — ar(x; a). 

We have r(x;a) — ► +oo, xr'(x;a) — > +oo and r"(x;a) — > as x — ► oo. Thus, we have lim a; ^ 00 p(x; a) = 
— oo. Similarly, we consider the equation g(x) = (^4 — a)/t(x). By letting x — > oo, we have h(x) — ► 1, 
xh'(x) — > and /i"(x) — > so that lirn^oo g(x) < 0. Together with Remark 4.2, this analysis shows that 
there exists y" > -F(a) such that fi(-; a) and (•) are concave on (y", oo). 

4.3.3. Square Root Process. In this case the functions r and h are given by 

a) _ („-^ + Q(A) - (a + A)) ( 2 iV (^J) - l) + 2,^.^ (^J) . 

h(x)=2N I ^ 9 1 -1. 



r (x; a) 
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Observe that r(x; a) > on (xi, oo) with some x\ > a since the only negative term in the first equation 
in (4.25) is bounded from below by — (a + A). Taking the derivative of r(x; a) we obtain 

The second term on the first line of (4.26) is positive and it dominates as x — > oo, therefore r'(x; a) > 
on x G (a?2,oo) with for some X2 > a. Take x' = max(xi, x%). It follows that R(y) is increasing on 

y G (y', 00), in which y' = -F(x'). On the other hand, h{x) G (0, 1) and h'(x) = ~^-4> ( ^^^) < °- 

However, h' goes to zero as x — > 00, which implies that (h/ip)' = h ^^I 1 ^ > on (x", 00) for some 
sufficiently large x". 

Next, we analyze the concavity properties of R(-; a) and H(-). Let us define p(x; a) = (A — a)r(x; a). 
As a result 

p(x; a) = (1 — 2px)r'(x; a) + 2xr"(x; a) — ar(x; a). 

We have r(x; a) — ► +00, xr'(x; a) — ► +00 and xr"(x; a) — ► as x — ► 00. Thus, we have lim x ^ 00 p(x; a) = 
—00. Similarly, we consider the equation q(x) = (A — a)h(x). By letting x — > 00, we have h(x) — ► 1, 
xh(x) — ► and xh"{x) — ► so that lim^oo g(x) < 0. Using Remark 4.2, we observe that R(-;a) and 
if (•) are eventually concave. 

4.3.4. Geometric Brownian Motion. When the aggregate income process X° is modeled by a geometric 
Brownian motion a sufficient condition for the hypothesis of the Proposition 4.1 to hold is \i < a. In this 
case the functions r and h are given by 

(4.27) 

r „; .) - ^ - (|) - (a + A) (* - (^) ) , 

Observe that d < since x > d. Moreover, N(d±) — ► 1, N(—d2) — ► and iV(d) — ► as x — ► +00. Also, 
r(x; a) > on (xi, 00) with some xi > a since the negative term in the first equation in (4.27) is bounded. 
On the other hand h{x) > for x G K + and h!(x) — > as x — > 00. The derivative of r is 

/ / ,x l+2/Vcr 2 \ 

r'(x; a) = e^ A JV(di) - ^ iV(-d 2 ) - (a + X)h'(x) 



o-\J~K \x J V o-\fK J \x 
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which is positive on x G (x2, oo) for some X2 > a. Take x' = max(xi, X2). It follows that R(y) is increasing 
on y G (y',00) with y' = F(x'). Similarly, since n(x) G (0, 1) and /i'(a;) goes to zero as x — > 00, so that 
H'{y) > on (y",oo) for sufficiently large y". 

Next, we analyze the concavity of R(-;a) and H(-). Let us denote p(x;a) = (A — a)r(x;a). The 
function p(-;a) is given by 



lim a .^ +00 p(x; a) = —00 when fj, < a. Similarly, we consider the equation q(x) = (A — a)h(x). h(x) — ► 1, 
h'(x) — > and h"{x) — > implies that lirn^oo q(x) < 0. Using Remark 4.2, we can conclude that R(-; a) 
and H(-) are eventually concave. 



See Figures 1-4 for numerical illustrations. In our examples we quantify the effect of delay in dividend 
payments. In each case we find the optimal dividend payment barrier, b* , the optimal amount of dividend 
payment, b* — a*, and the value function v. Then we compare them to 6°, b° — a and v°, the analogues 
of the previous quantities when there is no delay. As expected the value function is smaller, v < Vq when 
there is delay in dividend payments. Since in Figures 2 (b), 2 (e) and 4 (b), the value functions v and vq 
are not distinguishable, in Figures (2) and (4) we plot the difference of vq — v. 

When the aggregate income process, X° is modeled by a Brownian motion with drift, a square root 
process then we observe that a* < clq, b* < b^, b* — a* < b^ — ao and j3* < (3$. The same conclusion holds 
if X° is an Ornstein-Uhlenbeck process and the declaring bankruptcy is penalized. On the other hand, 
when X° is modeled by an Ornstein-Uhlenbeck process (the case in which declaring ruin is not penalized) 
or a geometric Brownian motion we obtain that a* = ao, b* > bo, b* — a* > 60 — a o an d /3* < (3q. Note 
that in both of these cases declaring bankruptcy is optimal as soon as the aggregate income level hits b* , 
regardless of the magnitude of delay. 

Observe that in the numerical examples considered, the function /3(a), which is obtained from (4.10) 
after we plug in for b that we obtain from (4.12) (say b(a)), is concave. It is either strictly decreasing 
or has a unique local maximum. We leave the proof of these features of the function /3(a) as an open 
problem. 

Remark 5.1. In our framework, it is easy to deal with solvency constraints. The optimal a* may not 
be acceptable, and prohibited by regulatory constraints. This was studied by Paulsen [22] in singular 
control setting (with no delays). Let as consider the case with A = and assume that the firm is not 
allowed to reduce its aggregate cash flow to below a. If we show the above properties hold for /3(a) a it 
is easy to argue if a* > a, then every time it pays out dividends the firm would reduce its reservoir to 
a* (the constraint is not binding), else if a* < a, then the firm every time it pays out dividends the firm 
would reduce its reservoir to a. 




where T(x; a) is the terms that involve 




Observe that 



5. Numerical Examples 
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/3(a) v(x) 




(a) (b) 

FIGURE 1 . A numerical example of a Brownian motion with drift with parameters (/i, a, a, A, A) = 
(0.3, 0.15, a/2, 0.1, 0.25): (a) The graph of (3(a) that attains the global maximum at a* = 0.755 with 
(3* = 1.443. (b) The value function v(x) (below) with b* — 2.719. It is compared with the case of 
A = (above) with (a , b , /3 )=(0.850, 2.895, 1.466). 



6. Conclusion 

We study optimal dividend payout problems with delay using various types of diffusions. Our method 
facilitates greatly the solution procedure due to the new characterization of the value function. The exis- 
tence of the finite value function and the uniqueness of optimal threshold strategy reduce to verifications 
of the assumption of Proposition 4.1. Our models here are more realistic since the delays with respect to 
dividend payments are explicitly handled. 
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FIGURE 2. A numerical example of an OU process with parameters (p, a, A, A) = 
(0.01,0.05,0.2,0.25): (a) The graph of (3(a) that attains the global maximum at a* = with 
(3* = 8.706. (b) The value function v(x) (below) with b* = 1.785. It is compared with the case 
v (x) of A = (above) with (a , bo, 0o)=(0, 1.783, 8.841). (c) Plot of the difference v (x) - v(x). 
(d) In the case of penalty at ruin, P = 10, the graph of /3(a) that attains the global maximum at 
a* = 4.290 with (3* = 0.953. (b) The value function v(x) (below) with b* = 6.811. It is compared 
with the case v (x) of A = (above) with (a ,b ,/3 ) = (4.349,7.141,0.979). (c) Plot of the 
difference vq(x) — v(x). 
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FIGURE 3. A numerical example of a square root process with parameters (p, a, A, A) = 
(1,0.1,0.1,0.25): (a) The graph of /3(a) that attains the global maximum at a* = 0.09 with 
(3* = 2.807. ( b) The value function v(x) (below) with b* = 0.662. It is compared with the case of 
A = (above) with (a , b , /3 )=(0.165, 1.014, 3.561). 
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FIGURE 4. A numerical example of an geometric Brownian motion with parameters 
(/i, cr, a, A, A) = (0.05, V2, 0.1, 0.1, 0.25) and the ruin level d = 1: (a) The graph of /3(a) that 
attains the global maximum at a* — 1 with /3* = 0.853. (b) The value function v(x) (below) with 
b* = 9.138. It is compared with the case of A = (above) with (oq, &o> A)) = (l> 7.318, 0.865). (c) 
Plot of the difference Vq(x) — v(x). 
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